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The  goal  of  the  research  was  to  understand  the  role  of  contextual  constraint  in  the  visual  analysis  of  complex,  natural  scenes.  This 
issue  bears  directly  on  the  basic  architecture  of  the  human  visual  system,  and  has  implications  for  the  design  of  artificial  vision 
systems  devoted  to  object  and  scene  analysis.  Two  hypotheses  were  considered:  (1)  Scene  constraint  influences  the  perceptual 
identification  of  individual  objects;  (2)  Scene  constraint  influences  only  post-identification  object  analysis.  The  main  results  of  the 
research  support  the  first  hypothesis:  Scene  constraint  does  not  directly  influence  perceptual  analysis  of  component  objects  in  human 
vision.  Other  results  from  the  research  supports  the  conclusion  that  semantic  information  is  not  used  to  drive  initial  eye  movements  in 
a  scene,  but  does  influence  initial  fixation  time  in  a  region  and  region  refixation  probability.  Based  on  these  results,  a  model  of  scene 
analysis  was  developed  in  which  object  identification  is  functionally  isolated  from  scene  meaning  and  gaze  control  is  initially 
independent  of  scene  semantics  but  becomes  sensitive  to  meaning  as  scene  perception  unfolds  over  time.  Continuing  work  is 
currently  aimed  at  testing  this  model  in  human  vision  and  gaze  control,  and  implementing  an  artificial  gaze  control  system  on  a  working 
robot  platform  using  a  Markov  Decision  Process  framework. 

14.  SUBJECT  TERMS 

scene  perception,  object  identification,  visual  cognition,  vision,  eye  movements,  gaze  control 

15.  NUMBER  OF  PAGES 

4 

16.  PRICE  CODE 

17.  SECURITY  CLASSIFICATION  18.  SECURITY  CLASSIFICATION  19.  SECURITY  CLASSIFICATION 

OR  REPORT  ON  THIS  PAGE  OF  ABSTRACT 

UNCLASSIFIED  UNCLASSIFIED  UNCLASSIFIED 

20.  LIMITATION  OF  ABSTRACT 

UL 

NSN  7540-01-280-5500  Standard  Form  298  (Rev.2-89) 


Prescribed  by  ANSI  Std.  239-18 


298-102 


19991215  031 


r 


REPORT  DOCUMENTATION  PAGE  (SF298) 
(Continuation  Sheet) 


The  main  results  from  the  research  conducted  under  this  grant  are  the  following: 

o  Prior  demonstrations  of  facilitated  performance  for  objects  that  are  semantically  consistent  with  their  scenes 
appear  to  be  compromised  by  a  confound  in  the  nature  of  the  control  stimulus  used  to  compute  measures  of  perceptual 
sensitivity  (Hollingworth  &  Henderson,  1997b).  Instead,  initial  perceptual  encoding  is  facilitated  for  an  object  that  is 
semantically  inconsistent  with  its  scene  compared  to  an  object  that  is  semantically  consistent  (Hollingworth 
&Henderson,  1998,  1999).  This  finding  is  contrary  to  models  of  object  and  scene  perception  that  assume  a  heavily  top- 
down  architecture,  but  supports  models  that  assume  “functional  isolation”  of  object  perception,  consistent  with  a 
modular  visual  architecture. 

o  The  placement  of  initial  fixations  during  real-world  scene  viewing  is  controlled  by  visual  rather  than  semantic 
features,  while  fixation  durations  and  later  fixation  placements  are  controlled  by  both  visual  and  semantic  features 
(Henderson  &  Hollingworth,  1998,  1999;  Henderson,  Weeks,  &  Hollingworth,  1999).  Again,  this  result  is  consistent 
with  limited  interaction  at  early  stages  of  scene  perception. 

o  Based  on  the  above  findings,  as  well  as  other  findings  in  the  eye  movement  literature,  we  have  developed  a 
gaze  control  model  based  on  a  visual  saliency  map(Henderson  &  Hollingworth,  1998).  We  are  currently  implementing 
this  model  on  a  robot  platform  using  a  Markov  Decision  Process  formalism. 

o  Covert  visual-spatial  attention  facilitates  the  encoding  of  perceptual  information,  in  addition  to  any  effects  it 
may  have  on  the  reduction  of  visual  noise  later  in  the  information  processing  system  (Henderson,  1996). 

o  The  facilitated  perceptual  encoding  of  semantically  inconsistent  objects  in  scenes  is  not  due  to  the  attraction  of 
-  covert  visual-spatial  attention  (Hollingworth  &  Henderson,  1997a).  This  finding  is  consistent  with  our  eye  movement 
data  showing  that  covert  attention  is  not  attracted  to  these  objects  (Henderson,  Weeks,  &  Hollingworth,  1997),  and 
suggests  that  the  engagement  of  attention  is  unaffected  by  semantic  consistency  (Henderson  &  Hollingworth,  1998). 

o  Foveal  analysis  is  not  necessary  for  object  identification  as  long  as  compensatory  time  is  available  for 
parafoveal  analysis  (Henderson,  McClure,  Pierce,  &  Schrock,  1997).  This  finding  contrasts  with  reading,  in  which 
foveal  analysis  is  necessary  for  word  identification.  It  will  be  necessary  to  determine  whether  this  finding  generalizes  to 
full  scenes. 

o  The  retention  and  integration  of  information  about  meaningful  objects  and  scenes  does  not  rely  on  a  veridical 
representation  of  object  contour  or  object  position  (Henderson,  1997;  Hollingworth  &  Henderson,  2000).  These  results 
are  consistent  with  the  view  that  only  abstract  representations  of  complex  real-world  stimuli  are  retained  and  integrated 
across  eye  movements. 
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